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*  LINEAR  SUFFICIENCY  AND  SOME  APPLICATIONS 
IN  MULTILINEAR  ESTIMATION 

Hilmar  Drygas 

1.  Linearly  sufficient  statistics  in  linear  models 

The  concept  of  linear  sufficiency  goes  back  to  work  by  Baksa- 
lary  and  Kala  [2],  Drygas  [6]  and  J.  Muller  [11].  Since  it  is 
needed  in  a  coordinate-  free  form  we  will  give  it  here  in  this 
form.  As  usual  a  linear  model  is  described  by  a  statistical 
field  (0,  F,  P)  and  a  random  H-valued  vector  y>H  an  inner- 
product  space,  such  that 

I 

(1.1)  Epyc  L  ¥  Pc  P 

(1.2)  Covpy  «  ®  ¥  P  «  P 

This  setup  is  also  called  the  model  M(L,  Q  ).  L  will  in  general 
be  a  linear  manifold  and  <B  a  cone  of  n.n.d.  matrices  (or  oper¬ 
ators).  In  this  paper  we  will  only  be  concerned  with  the  case 
<§)-  {o2Q:  o2>0}. 

If  the  model  M(L,  Q  )  is  given  then  a  linear  inhomogeneous 
transformation  d  4-  Gy,  G  a  linear  mapping  from  H  to  H,  is  called 
BLUE  (Best  linear  unbiased  estimator)  of  Ey  if  it  is  unbiased  and 
has  smallest  covariance-matrix  (covariance-operator)  in  the  class 
of  all  linear  unbiased  estimator  of  Ey.  d+  Gy  is  BLUE  of  Ey  iff 


(i)  d  -  (I-G)l  ¥  1  e  L,  (li)  Gf-f  VfcF-L-Land 
(iii)  GQx  ■  0  ¥  x  c  F1  a  ¥  Q  c  fih  . 


2  2 

A  BLUE  must  not  exist,  but  it  exists  in  the  case  H  »  {a  Q;  o  >0) 
since  F  n  QF1  ■  {0}.  (See  e.g. ,  Drygas  C5]). 

1.1  Definition:  Let  Ay  *  c  +  Ay.  Then  a  v  is  called  lihearly 

-  o  o 

sufficient  if  there  is  a  BLUE  of  Ey  which  is  a  linear  function 
of  AQy. 

1.2  Theorem:  AQy  is  linearly  sufficient  if  and  only  if  F  £im(WA*), 
where  W-Q  +  cPp  is  such  that  c^O  and  Fc  lm  (W).  (Pp  is  the  ortho¬ 
gonal  projection  onto  F  ,  A*  is  the  adjoint  mapping  of  A) . 

Proof:  1.  First  assume  that  F  c  in  (HA*).  Ve  consider  the  equation 

BAPp  *  Pp.  We  claim  that  this  equation  possesses  a  solution. 

*  *  * 

This  euqation  is  equivalent  to  Pp  -  PpA  B  or  Fc  in  (PpA  ) 

which  again  is  equivalent  to  (A Pp)-1  (0)  £ FA.  Therefore  let 

*  * 

A  Pp  x  «  0 ,  then  Pjft  *  W  A  b  for  some  b  and  A  Pp  x  «  AWA  b  «  0, 

£ 

implying  W  A  b  ■  Pj*  «  0. 

Now  let  Hz  be  a  BLUE  of  Ez  in  the  model  ll( AF ,  A  Q  A*) .  Then 
for  1  e  L 

(1.3)  (I  -BHA)  1  +B  Ay 

-  ( I  -  B  H  A)  1  +  B  G  He  +  B  H  AQy 

is  BLUE  of  Ey,  if  BAPp  -  Pp.  Indeed,  if  1  c  L  and  y-Qw,  w«FA, 
then  AQwa  AWw  ■  AWA*  v  for  some  v.  Since  Fc  in  (WA*)  is 
equivalent  to  (AW)”1^)  c  F1,  AW(w  -  A*v)  ■  0  implies  w-A*vcFx, 
i.e.,  A*V€Fa  or  v  e  A*”1(F1)  -  (AF)1.  Thus  HAW  A*v«0.  Thus 

(1.4)  (I-BHA)l-Ghc  +  GHA0(1  +  Qw) 

•  1  , 


proving  the  BLUE-property . 


m 


2.  Let  G  A  y+d  be  BLUE  of  Ey  in  M(L,{Q}).  Then 
o  o 


(1.5) 


G0A  f  «  1  V  1  e  F  -  L-L,  GQA  Q  FX  »  0. 


We  show  (AW)”*  (0)  c  F1  which  is  equivalent  to  Fein  (WA*).  Let 
AWt  «  0,  where  Wt  =  Qa  + f ,  a  e  F1,  f  e F.  Then  GQA  Wt  ■  0  *  f  im¬ 
plying  Wt  *  Qa  *  Wa,(t-a)  e  W“*(0)  c  F1  (since  F  c  im  W).  Thus 
t  -  a  +  (t-a)  6  Fx. 

Q.E.D. 

1.3  Definition:  Let  z  *»  AQy  be  linearly  sufficient.  Then  z  is 
called  linearly  minimal  sufficient  if  for  any  z^  *  A^y  which  is 
linearly  sufficient,  there  exists  a  such  that  z  *  EL z^  al¬ 
most  surely  mod  P. 

1.4  Theorem :  z  *  A«y  i  c  +  Ay  is  linearly  minimal  sufficient  if 
and  only  if 


(1.5) 


F  »  im(WA  ) 


The  proof  goes  along  the  lines  of  a  similar  proof  in  Drygas 


2.  Computation  of  expectation  and  covariance  for  multilinear 


expressions 

In  this  paragraph  we  are  assuming  that  ei>e2’***’en  are 
Independent  (at  least  up  to  a  required  order  concerning  the  com¬ 
putation  of  moments)  random  variables  with  expectation  zero  and 

V 

existing  moments  up  to  some  required  order.  The  moments  E(e1)  are 


.vjYSvi: 


A 


4 


assumed  to  be  equal  for  all  1.  Thus  e1>...,eQ  behave  -  at  least 
what  the  moments  up  to  a  certain  order  is  concerned  -  as  inde¬ 
pendently  identically  distributed  random  variables. 

I 

Let  e  *  (€j,...,en)  and  A  be  a  symmetric  n*n-matrix.  Then 
(2.1)  E(e ' A  e)  =  E(tr  (A  ee'))  -  E(tr  (A  o2In)) 

*  a2  tr  A. 

The  computation  of  E(e'  Ae)2  or  Var(e'  Ae)  is  tedious  but  it  is 
usually  considered  as"elementary  and  straightforward".  However, 
in  the  last  years  attempts  have  been  made  to  make  such  computa¬ 
tions  more  efficient.  We  mention  in  this  context  mainly  the  paper 
by  J.  Kleffe  [10]  who  has  elaborated  an  approach  originally 
adapted  by  Balestra  [3]  and  Neudecker  [12]. 

t  2 

Let  A  ■  (ajj»  i,j»l,2,...,n).  Then  for  computing  E(e  Ae) 
evidently 


(2.2) 


E<<  i,jaijEl£j>  (i,iakiekEl)) 


ij.k.i a 1 J1*1  E(VjEkel> 

is  needed.  Since  "ASA,  where  S  denotes  the  Kronecker- 

product  A  I  B  *  (a^B),  evidently 

(2.3)  E(e'Ae)2  -  tr((AS  A)  E(ee'  S  ee' )). 

This  formula  has  been  obtained,  by  Balestra  (3],  Neudecker  [12]  and 
Kleffe  [10]  via  a  different  technique.  However,  the  formula  (2.3) 
does  not  yet  help  very  much.  Let  us  therefore  rewrite  (2.2),  (2.3) 


in  the  form 


(2.3ft)  E(e  Ae  )  *  J  j(  I  8.1.1  E( e.  £. e  . £  . ) )  . 

i,j  iJ  k,l  •  1  1  3 

If  we  denote  the  n*n -matrix  ECe^e^ee')  *  ECe^Ej^Ej)  by  4>kl,  then 
evidently 


(2.4) 


E(e'A£)2  =  tr( A  •  l  akl  ^kl). 

k ,  1 


This  is  the  formula  obtained  by  Kleffe  [10].  If  k  =  l,  then  by 

independence  tkl  *  <l>kk  =  E(ei)ekk  +  a4  £  e66  by  if  we  denote  the 

i  fi^k  n 

matrix  e ^  ej  -  the  i-th  unit-vector  in  ]R  -  by  e.^ .  Let 

E(e4)  -  6  a4.  Then 

(2.5)  -  o4l(B-l)  ekk  +  I  +  In) 

6 

is  obtained.  Similarly  we  get  for  k  f  1,  that  by  independence 


(2.6) 


'ki  - 0  "ki +  eik>  • 


Finally  by  symmetry  of  A 


(2.7)  kIiSl»kl  -  o4<(  ♦I<e-l)»kkekk+2J)ll»klekl} 

-  c4{(tr  A)  In  +  (8-3)  diag  A  +  2  A)  , 
n 

where  trA«  ^  a. .  and  diag  A  is  the  diagonal  matrix  with  the 
k«l  KK  2 

same  diagonal-matrix  as  A.  Since  (tr  A)  -  tr((tr  A)In  •  A) , 
evidently 


(2.8) 


(Cov  ee  )  A  “2  A  +  (8-3)  diag  A, 


as  is  well-known  (Hsu  [8],  Drygas  [4]). 


6 


^  a  v  •.  v  v  r. 


\  .  v.‘,  V  A  .* 


g 

The 

method  developed  by 

& 

the  computation  of  variances 

fa 

i 

consider 

p 

the  ]Rn  as  the  set  1 

£ 

(a* 

»  •  •  •  t 

j  i, , . . . ,  i  ■  1 , . .  • , 

Ap«  A  P 

,  * 

£ 

ordered . 

An  element  a  »  (a, 

X1 

| 

tensor. 

If  b  €  ]Rn  ,  then  b  fi  1 

p-fold  tensor  with  elements  b. 

we  introduce  the  usual  inner  product 


.  .  *  b .  b .  . . .  b .  .  In  1R 

11» *T*  *  p  X1  x2  xp 


(2.9) 


<a,6>  =  l 


L  ai  i  ^i  i 

i1,...ip-l  1l‘“xp  11,“  p 


A  tensor  a  ■  (a.  .  )  is  called  symmetric,  if 

X1 .  p 

(2 ■ 10)  a  . .  %  _ / j  \  a  a.  . 

ir(i^)...7r(ip)  i^.  • .  ip 

for  any  permutation  ir  e  Sp.  Evidently  h®1*  is  a  symmetric  tensor. 
The  projection  on  the  set  of  symmetric  matrices  is  given  by  the 
symmetrizer  irg: 

(2.11)  <i>s»)  S(i1)ir(i!!)...ir(lp)- 

A3  a  4 

We  consider  e  and  e  .  Evidently 


(2.12)  E  <a,e®3>  -  (  l  a...)  E(e3) 

i»l  111  1 

Similarly,  if  a  is  symmetric 

(2.13)  E(<a,e®4>)  «  l  aijki  E(eiejekel)  " 

i i J  »k , 1 

o4( 6-3)  l  allii+  3  c4  ^  j  aiijj  * 

4  4  04  04  04 

where  again  E(ei)  -  Bo  .  Since  <a,e  >  ■  <a,irg  e  >«<e  ,irga> 

the  restriction  to  symmetric  a  is  not  essential.  We  will  come 

back  to  this  at  the  end  of  the  paragraph. 

03 

What  the  computation  of  the  covariance-operator  of  e  and 


e  is  concerned  it  is  hardly  possible  to  get  simple  expressions 
without  additional  assumptions.  Therefore  we  will  assume  in  the 
sequel  that  e  is  quasi-normally  distributed,  i.e.,  that  the  mo¬ 
ments  up  to  order  6  and  8,  respectively,  coincide  with  the  normal 

4  4 

moments.  This  means  that  for  p  ■  3  we  assume  that  E(ei  )  *  E(e^  )  = 
3  o4  ,  E(e®)»  E  (e®)  =  15  o®  and  for  p=4  additionally  = 

8  On  o 

105  a  is  assumed  to  hold.  To  compute  E(<a,E  )evidently 

n 


(2.14)  l  a. 

il,...,ip,Jl,...,Jp*1  11*‘ 


i  ei  ei 
Jp  X1  X2 


•  •  •  £ j  £  . 

p  ^i 


has  to  be  computed. 
V  is  some  operator, 
n 


(2.15) 


a. 


Lr 


. i  *1  A1 


This  may  also  be  written  as  <a,Va>,  where 
Evidently  (2.14)  is  equal  to 


l 


f  •  • 


e 

t 


)) 


implying  that 
(2.16)  Va 


n 

l 


Ji  • 


where  \fi 


j j.  •  •  j 


P 


Jr..Jp-l  J1 

,E.  ...E.  E  ). 
31  JP 


•Jp  •  -  Jp 


We  will  compute  Va  for  p*3  via  formula  (2.16)  and  for  p  » 4 

by  just  computing  E(e.  ...e.  e.  ...e.  ).  This  will  allow  a  corn¬ 
el  Jp  X1  xp 
parison  of  the  two  methods. 

To  do  the  computations  for  p  ■  3  evidently  'l'otgY  »  ^aQtg  and 

\l>  (a  M  Y)  have  to  be  considered.  We  denote  by  e  the 
oaa  apY 

tensor  having  a  1  at  place  (a,8,y)  and  zero  elsewhere.  Let, 

n 

moreover,  I  -  l  _  artoft. 


Th -n  we  ge*  for  symmetric  a: 


(2.17)  ijja$y  -  J  Va)ir(B)ir(Y)  "  6*Sea 


<2*18)  *ooB  "  3°6  ^BB  +  a66^  g(e666+ 


6  6  6 
+  3  o  (  e  a  +  e  a  +e.  )  =  3  a  tt-Iq  +  » a  ir0e 

v  aaB  aBa  Baa  SB  S  aaB 


<2-l9>  «W  -15°6  eaca  +  3  ^  WWW 

-  6  °6  eaaa  +  9  °6  "sV 


Finally 

(2-20)  Va;j.>^*a6,eJ^^ 


+  9  8  <aJs,°‘aB>  ’Sl6+6  °  4Ba““B<WeaB“+eBo,“) 

+  6  *  aoaaeaaa  *  9  ^"sVW 
a  a 


°  18  J  aaBYeaBY  +  9  „LaaaB1^SI6, 
o,B,Y  a>B 


o°16a  +  9  l  (trg  aJiTglg)  , 


where  tTg  a 


l  a  - .  Evidently  also 


(2.21) 


Va  -  o6{0a  +  9  J  (TrgIg  o  TrgIg)}  , 


where  aob  denotes  the  outer  product  defined  by  (aob)c»<b,c>a. 


9 


This  follows  from  <a,ffglg>  =  tr^a,  a  is  symmetric.  It  is  also 
true,  that 


(2.22)  J  OTsI6OTTsIe)=TTs(In  fi  vec(In)(vec  In)>')TTs, 
p=1' 


the  representation  found  by  Pukelsheim  in  [14],  This  representa¬ 
tion  will  not  be  used  in  this  paper. 


We  now  assert,  that  for  symmetric  a  = 


(2.23)  E(<a,e®4>2)  =  £  aiiklarstuE( eie iekelereseteu) 

i, j,k,l,r,s,t,u=l  1JKi  rstu  ljkirstu 


8, 


=  o  {24 


l  aiikl+72_.  ,  l  aiiika; 


+  9  £  aiii iakkall^ * 

i ,  j  ,k,  1=1  11JJ  KK  11 


Indeed,  under  quasi-normality,  E(c±z vanishes  if 


some  ea  appears  an  uneven  number  of  times.  Therefore  only  the 


86244422  .2222  ^  w 

cases  e^,  Zy  z^  z j ,  and  eiejekei  are  to  *>e 


considered.  If  all  indices  i,j,k,l  are  different  from  each  other 
then  surely  the-  sums  reported  in  (2.23)  will  appear.  The  factor 
24,  72  and  9  arise  from  careful  combinatorial  considerations  and 
the  fact  that  a  is  symmetric.  Note  that  some  combinations  are 
covered  by  the  summation.  If  i  =  j*k  =  l,  then  the  subsum  in 
(2.22)  is  equal  to 


(2.24) 


105, 8  aiiit  ■<«;>■ 


n 


n 


Now  consider  the  6  cases  i  *  j ,  i  =  k ,  i  =  1 ,  j“k,  j  =  1  and  k  =  1 . 
Then  by  symmetry  the  corresponding  subsum  in  (2.23)  is  equal  to 


10 


.8 


(2.25)  a  {18  ^aiiiiajJkk +  288  Ea...^.^ 


+  216  Zaiijk  +  108  I  aiij(jaiikk} 

=  3  a8{ 6  ^a.iiiajjkk +  96  Sa.iija.jkk 

+  72  ^aiijka.iJk  +  36  Zaii j jaiikk) 

4  4 

In  view  of  E(ei)  =  3  o  and  symmetry  this  is  just  the  set  of  all 

4  2  2 

possible  summands  occurring  with  factor  E(ei£j'ek).  Again  care¬ 
ful  considerations  are  necessary  to  establish  the  combinatorial 
numbers  6 , 96 , 72  and  36.  Finally  we  get  for  the  seven  cases 
i=j=k;  i  =  j  =  1 ;  i  =  k  =  1 ;  j=k  =  l;  i=j,  k  =  1 ;  i  =  k ,  j  =  1  and 
i =  j ,  k  ■ 1  as  subsum  of  (2.23) 

(2.26)  68{180  ^aiiiiaiiJJ  +  240  Za2^ 


+  9  Eaiiiiajjjj  +  144  Eaiiijai j j j  +  162  Eaiijj} 

15a8{  12  J‘aiiiiaii  j  j  +  16  Ea2^} 

+  9<J8{^aiiiia.j.j  +  16  Za^ja^jj  +  18  Ea2^}. 

In  view  of  E(e^8  )  =  15a8,  E(e4^)  =  ’3a4  the  first  sum  belongs  due 

6  2 

to  symmetry  of  a  to  all  terms  where  E(e^£j)  appear.  The  second 

4  4 

sum  belongs  to  all  terms  where  ECe^)  occurs.  Again,  careful 
reasoning  is  necessary  to  determine  the  combinatorial  factor  12, 
16,  1,  16  and  18. 

Since  the  last  term  in  (2.22)  is  evidently  equal  to 
9o8(IaljLj j  )2  -  CE(<a,e®4>)]2  it  follows  that  for  symmetric  a: 


«*  ft 


11 


(2.27)  Var(<a,e®4>)  =  a8{ 24  IaJjkl  +  72  £aiijkajkll} 


Define  the  tensor  e  g.  .,  which  has  unity  at  place  (a,B,Y,<5)  and 

PT  n  n 


zero  elsewhere.  Let  I ..  =  \  e  ..  and  tr . 

J  a=«l  JKQt  J 

evidently 


j.k*  aIiajkaot‘ 


Then 


(2.28)  Var(<a, £*"*>)  =  o  {<a, 24a  +  72  \  (trjka)iTgIjk>}  . 


3  ,k=l 

This  shows  that  for  symmetric  a  evidently 


04  ft  “ 

(2.29)  Cov(e"*)  =  a8{24  1  +  72  J  <V jk  °  1TSIjk) } 

J  > k= 1 

where  ^ob)  again  denotes  the  outer  product:  (a  o  b)c  *  <b,c>a. 

(2.28)  can  also  be  written  as 

(2.30)  a8{24  I  +  72{ir^(ln  A  Iq  A  (vec  In)  (vec  In)  ’  A  (vec  In)  (vec  In)  itg} 

the  representation  given  in  Pukelsheim  [14].  This  representation 
will  not  be  used  here. 

A  final  remark  of  this  paragraph  concerns  the  covariance- 
opeartor  (2.28).  This  formula  is  only  correct  if  it  is  really 
considered  as  a  covariance-operator,  restricted  to  the  symmetric 
tensors.  It  is  not  identical  with  the  covariance-matrix.  Let 

AO  AO  * 

us  assume  we  have  computed  the  covariance-matrix  E(e  (e  )  )  *  C. 

4 

From  (2.4)-(2.6)  we  get  in  the  quasi-normal  case  Ca  ■  a  {(a^)  + 

<*ji>  +  <  tr  a)IQ).  Since  in  general  vec(bb')  ■  b  A  b  it  follows 

A4  AO  AO  »  { 

that  E(e  )  -  vec(E(e"  (e  )  )  *  vec(C).  Denote  by  e. .  the 
2 

vector  in  IRn  having  1  at  place  in  + j  and  zero  elsewhere,  then 

4 

it  is  easily  checked  that  vec(C)  is  different  from  3o 

(ell*  *  ,eln* .  .enl' *  ,enn) ,  ^ut  ^Quality  is  obtained  when  the  two 


matrices  are  applied  to  symmetric  tensors.  In  so  far  the  assertion 
in  Pukelsheim  [14]  claiming  that  (2.29)  is  the  covariance-matrix 
is  wrong. 


3.  Linear  sufficient  statistics  in  multilinear  estimation 

n  o 

We  consider  the  linear  model  EycLc  ]R  ,  Cov  y  *  o  In  as 
described  in  section  1  of  this  paper.  Let  F  ■  L-L  and  Ppy  de¬ 
note  the  orthogonal  projection  of  y  onto  F.  Then  GQy  *  1+Pp(y-1), 
1  €  L  is  the  unique  BLUE  of  Ey.  We  consider 

(3.1)  u  «  0-1(y-Ey),  z  -  (I-PF)(y-l)  -  o(I-PF)u. 

The  quantity 

(3.2)  V  «  zz*  «  (I-PpHy-lKy-D’d-Pj.)  «02(I-PF)uu'(I-PF) 

is  a  random  element  with  values  in  the  set  H  of  all  symmetric 
nxn-matrices  A  satisfying  Af  «0  ¥  f  €  F.  Let  II  ■  (I-PF),  then 
A  e  H  iff  MAM  -  A  (Drygas  [4]).  In  H  the  inner  product 
<A,B>  ■  tr(AB)  is  used.  Since  <zz',A>  ■  o  (u  Au)  and  by  (2.1), 
(2.8) 

(3.3)  E(u  Au)  ■  tr(A)  »  tr(MA) 

(3.4)  Var(u'Au)  -  tr([2A  +  (0-3)  diag  A]*A) 

-  tr(C2A  +  (6-3)  M»  diag  A*M]A) , 


we  get  that  V  *zz'  follows  the  linear  model 

(3.5)  EV-  o2M,  CovV-  o4{2A  +  ( 8-3)M  diag  AM}, 


if  considered  as  H-valued  random  element. 

Besides  the  mapping  Diag  A  *  (aij5ij)  which  is  evidently 
self-adjoint  we  consider  the  linear  mapping  diag:  lRn *►  Bnxn 
defined  by  diag  (x^,...,xu)'  ■  (SjjX^).  Evidently  the  adjoint  is 

dlag^a^)  *  (an»  •  •  •  »ann^ '  ’  A  and  B  are  two  nxn“matrices 

then  the  Hadamard-product  A  ♦  B  is  defined  by  (A  *  B)  *  (a.  ,b  ). 

ij 

3.1  Theorem :  Let  Mj*0.  Then  tr(MV)  ■  tr(V)  is  a  linearly  suf¬ 
ficient  statistic  in  the  model  (3.5)  iff  the  Hsu-condition 
<j4(8-3)(M  *M)  m * p m ,  where  p  ■  (8-3)o4  tr(M  *M)/(tr  M)  is  met. 

In  this  case  tr(MV)  «  tr  V  is  also  linearly  minimal  sufficient. 


Proof :  Consider  the  linear  mapping  AV  *  tr(MV).  This  is  a  map 
ping  from  H  to  IR.  The  adjoint  mapping  of  this  mapping  is 
A*  X  *  XM  ,  x  e  IR.  A  is  linearly  sufficient  iff 


(3.6) 


(XM)  c  (X  W  M}  -  im(WA*) , 


where 

(3.7)  W  -  Cov  V  +  (tr  M)"1  (MoM), 

since  (tr  M)-*  MoM  is  the  orthogonal  projection  onto  {XM}.  This 
is  the  case  iff 

(3.8)  WM  ■  2  o4  M  +  o4(  6-3)  M(Diag  M)  M+M-pM 

4 

for  some  p  f  0.  This  means  that  a  (8-3)  M(Diag  M)  M»aM  for 


some  a  c  ]R.  By  taking  traces  on  both  sides  of  the  last  equation 
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o  -  (tr  II)"1  a4( 0-3)  tr(M  dlag  M)  -  (tr  II)"1  <j4(B-3)  tr(M*  M). 
Since  tr(M  *11)  <_  tr(U)  and  8  >  1  it  follows  that  p  »  2  a4  +  a  + 
1^1  >0.  But  by  Hsu's  theorem  ( Drygas-Hupet  [7],  Pukelsheim 
[13],  Khatri  [9]),  II  diag  x  M  ■  0  is  equivalent  to  (M*  M)x-0. 
This  finishes  the  proof  of  the  theorem  since  the  assertion  con¬ 
cerning  linear  minimal  sufficiency  is  now  obvious. 


3.2  Theorem.  Let  II  f  0.  Then  diag 
linearly  sufficient  statistic. 


<vll . v„n>  is  a 


Proof :  1)  Since  we  consider  V  as  an  element  of  H,  diag  is  to 
be  considered  as  mapping  from  H  to  Hn.  The  adjoint  ((diag)*)* 
of  this  mapping  is  not  diag  but  II  diag  M,  since  for  AcH 


(3.9) 


tr  (M  diag  xH  •  A)  *  tr(diag  x  •  A)  -  x'  (diag)*  A 


and  II  diag  x  II  e  H. 

2)  Two  cases  have  to  be  distinguished.  Either  there  is  an 
element  AcH  such  that  (Cov  V)A-M  or  there  is  an  element  AcH 
such  that  tr(AM)  f  0  and  (Cov  V)A»0.  (The  latter  case  can  only 
occur  if  8-1.)  This  follows  from  im(Q)  -  (Q"1(0))1,  if  Q  is 
self-adjoint. 

In  the  first  case  theorem  1.2  tells  us  that  we  can  choose 
V  -  Cov  V,  while  in  the  second  case  V  will  be  chosen  equal  to 
Cov  V  +  (tr  1I)"1(M  o  M) .  In  both  cases,  however,  Me  im(W(diag* )  )* ) 
has  to  be  proved.  Let  M  •  (Cov  V)A  -  o4  {2A  +  (8-3)  M  Diag  AM)  , 
AcH.  This  implies  at  first  that  o4  can't  vanish.  Therefore 


ra 


(3.10)  A  ■  (2o  )  U(diaK  i  -o*(B-3)diag  Diag’A)  M  -  Mdiagx  M 

n 

with  x  -  (2o4)_1(ln  -  o4( B-3)  Diag*A),  ln  -  (lj^ . 1)’-  Thus 

*  *  * 

M  -  (Cov  V)  (diag'  )  x  and  diag  is  linearly  sufficient. 

In  the  second  case  (Cov  V)  A-0,  WA  -  (tr  M)"1  tr(MA)M  f  0, 
implying  WB  -  M,  B  -  ( tr(MA) )“1(tr  WM  and 

(3.11)  (Cov  V)B  -2  a4  B + a4( B-3) M diag  Diag*  BM-0. 

Thus  if  a  j*  0,  B  -  -2_1(8-3)  M  diag  Diag*  BM-M  diagxM,  x  - 
-2-1(B-3)  Diag*  B.  If  o-O,  then  evidently  W(diag*)*  ln  - 
W  M  diag  1Q  H-M.  Therefore  linear  sufficiency  is  proved  in  all 
possible  cases,  Q.E.D. 

We  will  now  consider 


(3.11)  V±  -  z®1  -  II®1  z®1  ,  i  -  3,4. 

Since  M®1  z®1  -  (Mz)®1  -  a1  II®1  u®1  we  can  apply  the  results 
of  paragraph  2  for  obtaining  expectation  and  covariance-operator 
of  .  First  of  all,  note  that  is  a  symmetric  tensor  obeying 
the  equation  ll®iVi  -  VA.  Therefore  our  reference  vector-space 
H  will  be  the  set  of  all  symmetric  tensors  a  meeting  the  equation 
M®ia  -  a. 


We  Introduce  the  following  notation:  Let  a e  1R  ,  a > (a^ , . . . 

1 


Then  we  define  diag.  a  -  (a.  6.  .  . ..«.  . 

i  12  yp 


...6j  4  ,  i. , . . . ,i  -  1,2, . . . ,n 


2  P 


c  B  .  In  general  if  a  -  (a.  .  )cB  and  p  >  k  we  define 

xl*  *  * xk  d 

(3.12)  diagk(a)-(a1  ±  6.  ± 

k  xl*  * ,xk  xkx] 


.  .  .  6  .  .  )  e  B 

lk+l  xkXp 


With  these  definitions  we  evidently  get  from  (2.11)  and  (2.12): 
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(3.13)  E(z®3)  -  o3E(u3)ll®3  diag1  1Q,  1Q  *  (1 , .  ,D  • 

(3.14)  E(z®4)-a4M84  {(3-3)  diagj  1Q  +  3  f  ^  *s  I^h 

The  covariance-operator,  defined  as  mappings  from  H  to  H,  are 
found  in  the  quasi-normal  case  (use  li^a-  a! )  to  be  equal  to 


(3.15) 


Cov(z®3)  -  a6{61  +  9  M®3  f  (Wglg  °  iTgl^) } , 


(3.16) 


Cov^®4)  »08{24I  +72  II®4  l  Vjk°Vjk}' 


J.k-1 


This  model  has  intensively  been  studied  in  Pukelsheim  [14]. 

Since  the  covariance-operator  is  only  computed  under  quasi¬ 
normality  the  estimators  derived  from  linear  model  theory  are  only 
locally  best  (linear)  unbiased  estimators.  Pukelsheim' s  inves¬ 
tigation  was  suggested  by  a  paper  by  Anscombe  [1],  who  used 

^  AO  £  A4  ^4 

diagj  z  *z  and  diag^  z  ■  z  to  obtain  estimators  of 
3  4 

E(u3 )  and  E(u1),  respectively.  Pukelsheim  showed  that  these  es- 

A3  A4 

timators  are  not  even  locally  best.  Using  z  and  z  ,  respect¬ 
ively,  means  the  consideration  of  linear  combinations  of  z^ZjZk 

(i,j,k  ■  l,...,n)  and  of  ziz^zhzi  (i ,  j  ,k,l  »  1 , . . .  ,n) ,  respectively. 

o 

But  we  will  show  now  that  it  is  enough  to  consider  only  z^z^ 

2 

(i, j  » 1, . . . ,n)  and  z^ZjZk  (i, J ,k  ■  1; . . . ,n) ,  respectively.  Evi- 

2  *2  2 
dently  (ZjZj)  ■  diag2(z1z^zk)  ■  z  •  z.  Similarly,  (ZjZjZk)  “ 

*2  B2  Ad 

z  6  z  ■  diagg  z  .  We  will  prove  that  these  statistics  are 
linearly  sufficient. 

*2  *  A3 

3.3  Theorem,  a)  z  ft  z  ■  diag2  z  is  linearly  sufficient  in 
the  model  described  by  (3.13),  (3.15). 
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AO  4t  O  4c 

b)  z  A  z  -  diagg  z  is  linearly  sufficient  in  the  model  des¬ 
cribed  by  (3.14),  (3.16). 

proof :  a)  Since  our  reference  vector-space  H  is  the  set  of  all 

n?  ®3  *  * 

symmetric  tensor  a  from  IR  meeting  a*  a,  (dlag0)  has  to 

n2 

be  mapping  from  It  to  H.  This  mapping  is 


(3.17) 


This  follows  since 


(diag*)*  -  M®3  vgdiag2 
M®3TrQdiag«  b  eH  and  <ll®3ir„  diag0  b,< 


♦  n 

<diag2  b,c>  -  <b,  diag2  c>  for  all  b  e  St  ,  c  e  H. 

a 

Ve  firstly  deal  with  the  case  o  >0.  Let  M  f  0,  otherwise 

there  is  no  assertion.  Let  W  ■  (ll^diagj^  lnoM*3  diagx  1Q) 

(  l  m?.)-1.  W  is  the  orthogonal  projection  onto  F  “(y  It*3 
1*1 

diag|  ln>.  Since  CovVg-0  and  W  M®3  diagx  1Q  -  M®3  diagx  ln  - 
II®3  TTg  diag2  vec(In)  ■  (diag*)*  vec(In),  (dlag2)  is  clearly 
linearly  sufficient. 

Now  let  o6  f  0,  then  Cov  Vg-cr6{61  ♦  E II®3  Z  ...}  is  regular 

or  H,  since  <(Cov  V-)a,a>  ■  6  o®  <a,a>  +  9  o8  J  (Ja. . .)2  vanishes 

3  j  i  11J 

iff  a»0.  For  this  reason  there  is  a  tensor  acH  such  that 

(3.18)  ll®3diag1  1Q  -  6  a6  9  o6  II®3  \  ("glj  °  "glj >» 

Since  V  can  be  chosen  equal  to  Cov  Vg,  our  assertion  would  be 
proved  if  we  could  show  that  a  has  the  form  M®3Wgdiag2b  for 
belt0  .  But  (3.18)  implies  that 


(3.19)  a  -  (6c6)”1  ll®3{diag1  1Q  -  9  c°  l  <lj,a>irglj} 

J*1 


M®3ir*diag„  b; 


some 
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r  *.*  «.•  -  ov  v  v  v'  * 


where  b  ■  (b^j)  and 
(3.20)  b..  - 


<6°6>'1(Sij  - 9  °61,I).1aiij>- 


b)  Again,  (dgg)*  ■  M®4^  dgg  can  easily  be  established.  Since 

a”8  <(Cov  VA)a,a>  -  24<a,a>  +  72  [  <Iaikii)2»  a 6 H  is  positive 

4  j ,k, 1  1  AKAA 

whenever  a^O,  Cov  V4  is  regular  and  W  ■  Cov  V4  is  a  possible  choice 
if  a8  f  0  (o8*0  can  analogously  be  dealt  with  as  above).  There¬ 


fore  there  is  an  element  a  e  H  such  that 

(3.21)  Wa  -  M^diagj  3n  -  o8{24a + 72  M®4  £  <1  Jk.a>irsIjk} 

j  *  k 

and  an  element  bcH  such  that 

(3.22)  Wb-ll®4  l  TTgIii -08{24b +72  M®4  l  <I jk} 

i 

These  two  equations  can  be  rewritten  as 

(3.23)  a-(24o8)‘1  ll^diagj  ln  -  72  a8  l  <Ijk.a>irSIjk} 

■  diaggCj^}, 


(24c8)”1  II*4 il  it, 
i  1 

If®4  TTg  diagg  Cg, 


(3.24)  b-(24o8)”1  M®4^  irsIii-72  o8  l  <1  jk .*>> V jk}  • 

i  j  ik 


where 

(3.25) 

C1  *  (iiJSik 

(3.26) 

°2  "  (413  * 

(3.25)  cx  -  («ij6lk  -  *20  j  <2  aa6yY;,/^eo  , 

a ,  p  t 

(3.26)  c2  -  (6  -  72o8  l  a  )/(24o8) 

*  J  o,$, Y 

This  shows  that  ia(V(diagg)*)  contains  the  set  of  possible  expec¬ 
tation-values,  Q.E.D. 
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